Dataset statistics
| Number of variables | 30 |
|---|---|
| Number of observations | 990 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 239.8 KiB |
| Average record size in memory | 248.0 B |
Variable types
| Numeric | 6 |
|---|---|
| Categorical | 24 |
ECOG is highly overall correlated with ECOG_encoded | High correlation |
ECOG_encoded is highly overall correlated with ECOG | High correlation |
OS is highly overall correlated with PFS and 3 other fields | High correlation |
PFS is highly overall correlated with OS and 6 other fields | High correlation |
age is highly overall correlated with age_encoded | High correlation |
age_encoded is highly overall correlated with age | High correlation |
best_response is highly overall correlated with PFS and 5 other fields | High correlation |
best_response_encoded is highly overall correlated with PFS and 5 other fields | High correlation |
clin_benefit=Yes is highly overall correlated with OS and 6 other fields | High correlation |
msi_type is highly overall correlated with msi_type_encoded | High correlation |
msi_type_encoded is highly overall correlated with msi_type | High correlation |
progression=Yes is highly overall correlated with PFS and 5 other fields | High correlation |
response=Yes is highly overall correlated with OS and 5 other fields | High correlation |
stage is highly overall correlated with stage_encoded | High correlation |
stage_encoded is highly overall correlated with stage | High correlation |
tx_line is highly overall correlated with tx_line_encoded | High correlation |
tx_line_encoded is highly overall correlated with tx_line | High correlation |
vital_status is highly overall correlated with OS and 5 other fields | High correlation |
msi_type is highly imbalanced (71.8%) | Imbalance |
stage is highly imbalanced (81.9%) | Imbalance |
cancer_type=LGI is highly imbalanced (67.4%) | Imbalance |
drug_class=ctla-4 is highly imbalanced (95.4%) | Imbalance |
stage_encoded is highly imbalanced (81.9%) | Imbalance |
msi_type_encoded is highly imbalanced (71.8%) | Imbalance |
id has unique values | Unique |
tmb_mutations_mb has 22 (2.2%) zeros | Zeros |
age_encoded has 16 (1.6%) zeros | Zeros |
Reproduction
| Analysis started | 2024-03-10 19:35:27.520662 |
|---|---|
| Analysis finished | 2024-03-10 19:35:30.493799 |
| Duration | 2.97 seconds |
| Software version | ydata-profiling vv4.6.4 |
| Download configuration | config.json |
id
Real number (ℝ)
UNIQUE 
| Distinct | 990 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9153.2323 |
| Minimum | 8215 |
|---|---|
| Maximum | 10289 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.5 KiB |
Quantile statistics
| Minimum | 8215 |
|---|---|
| 5-th percentile | 8298.45 |
| Q1 | 8635.5 |
| median | 9151.5 |
| Q3 | 9670.75 |
| 95-th percentile | 9971.1 |
| Maximum | 10289 |
| Range | 2074 |
| Interquartile range (IQR) | 1035.25 |
Descriptive statistics
| Standard deviation | 569.91101 |
|---|---|
| Coefficient of variation (CV) | 0.062263361 |
| Kurtosis | -1.3572457 |
| Mean | 9153.2323 |
| Median Absolute Deviation (MAD) | 519 |
| Skewness | -0.017417286 |
| Sum | 9061700 |
| Variance | 324798.56 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8215 | 1 | 0.1% |
| 9510 | 1 | 0.1% |
| 9514 | 1 | 0.1% |
| 9517 | 1 | 0.1% |
| 9518 | 1 | 0.1% |
| 9519 | 1 | 0.1% |
| 9522 | 1 | 0.1% |
| 9523 | 1 | 0.1% |
| 9526 | 1 | 0.1% |
| 9527 | 1 | 0.1% |
| Other values (980) | 980 |
| Value | Count | Frequency (%) |
| 8215 | 1 | |
| 8216 | 1 | |
| 8217 | 1 | |
| 8219 | 1 | |
| 8221 | 1 | |
| 8222 | 1 | |
| 8223 | 1 | |
| 8226 | 1 | |
| 8229 | 1 | |
| 8230 | 1 |
| Value | Count | Frequency (%) |
| 10289 | 1 | |
| 10276 | 1 | |
| 10255 | 1 | |
| 10254 | 1 | |
| 10251 | 1 | |
| 10246 | 1 | |
| 10245 | 1 | |
| 10239 | 1 | |
| 10238 | 1 | |
| 10032 | 1 |
tx_year
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| 2017 | |
|---|---|
| 2018 | |
| 2016 | |
| 2015 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 3960 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2018 |
|---|---|
| 2nd row | 2015 |
| 3rd row | 2018 |
| 4th row | 2017 |
| 5th row | 2016 |
Common Values
| Value | Count | Frequency (%) |
| 2017 | 356 | |
| 2018 | 353 | |
| 2016 | 210 | |
| 2015 | 71 | 7.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2017 | 356 | |
| 2018 | 353 | |
| 2016 | 210 | |
| 2015 | 71 | 7.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 990 | |
| 0 | 990 | |
| 1 | 990 | |
| 7 | 356 | 9.0% |
| 8 | 353 | 8.9% |
| 6 | 210 | 5.3% |
| 5 | 71 | 1.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3960 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 990 | |
| 0 | 990 | |
| 1 | 990 | |
| 7 | 356 | 9.0% |
| 8 | 353 | 8.9% |
| 6 | 210 | 5.3% |
| 5 | 71 | 1.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3960 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 990 | |
| 0 | 990 | |
| 1 | 990 | |
| 7 | 356 | 9.0% |
| 8 | 353 | 8.9% |
| 6 | 210 | 5.3% |
| 5 | 71 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3960 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 990 | |
| 0 | 990 | |
| 1 | 990 | |
| 7 | 356 | 9.0% |
| 8 | 353 | 8.9% |
| 6 | 210 | 5.3% |
| 5 | 71 | 1.8% |
age
Categorical
HIGH CORRELATION 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| 61 - 70 | |
|---|---|
| 51 - 60 | |
| 71 - 95 | |
| 41 - 50 | |
| 31 - 40 |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 6930 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 61 - 70 |
|---|---|
| 2nd row | 51 - 60 |
| 3rd row | 51 - 60 |
| 4th row | 51 - 60 |
| 5th row | 61 - 70 |
Common Values
| Value | Count | Frequency (%) |
| 61 - 70 | 346 | |
| 51 - 60 | 241 | |
| 71 - 95 | 230 | |
| 41 - 50 | 109 | 11.0% |
| 31 - 40 | 48 | 4.8% |
| 21 - 30 | 16 | 1.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 990 | ||
| 61 | 346 | 11.6% |
| 70 | 346 | 11.6% |
| 51 | 241 | 8.1% |
| 60 | 241 | 8.1% |
| 71 | 230 | 7.7% |
| 95 | 230 | 7.7% |
| 41 | 109 | 3.7% |
| 50 | 109 | 3.7% |
| 31 | 48 | 1.6% |
| Other values (3) | 80 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1980 | ||
| 1 | 990 | |
| - | 990 | |
| 0 | 760 | 11.0% |
| 6 | 587 | 8.5% |
| 5 | 580 | 8.4% |
| 7 | 576 | 8.3% |
| 9 | 230 | 3.3% |
| 4 | 157 | 2.3% |
| 3 | 64 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3960 | |
| Space Separator | 1980 | |
| Dash Punctuation | 990 | 14.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 990 | |
| 0 | 760 | |
| 6 | 587 | |
| 5 | 580 | |
| 7 | 576 | |
| 9 | 230 | 5.8% |
| 4 | 157 | 4.0% |
| 3 | 64 | 1.6% |
| 2 | 16 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 1980 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 990 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6930 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1980 | ||
| 1 | 990 | |
| - | 990 | |
| 0 | 760 | 11.0% |
| 6 | 587 | 8.5% |
| 5 | 580 | 8.4% |
| 7 | 576 | 8.3% |
| 9 | 230 | 3.3% |
| 4 | 157 | 2.3% |
| 3 | 64 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6930 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1980 | ||
| 1 | 990 | |
| - | 990 | |
| 0 | 760 | 11.0% |
| 6 | 587 | 8.5% |
| 5 | 580 | 8.4% |
| 7 | 576 | 8.3% |
| 9 | 230 | 3.3% |
| 4 | 157 | 2.3% |
| 3 | 64 | 0.9% |
nlr
Real number (ℝ)
| Distinct | 520 |
|---|---|
| Distinct (%) | 52.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.4006869 |
| Minimum | 0.3 |
|---|---|
| Maximum | 87 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.5 KiB |
Quantile statistics
| Minimum | 0.3 |
|---|---|
| 5-th percentile | 1.53 |
| Q1 | 2.86 |
| median | 4.42 |
| Q3 | 7.13 |
| 95-th percentile | 16.714 |
| Maximum | 87 |
| Range | 86.7 |
| Interquartile range (IQR) | 4.27 |
Descriptive statistics
| Standard deviation | 7.0841867 |
|---|---|
| Coefficient of variation (CV) | 1.1067854 |
| Kurtosis | 36.310545 |
| Mean | 6.4006869 |
| Median Absolute Deviation (MAD) | 1.825 |
| Skewness | 4.8687374 |
| Sum | 6336.68 |
| Variance | 50.185701 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 16 | 1.6% |
| 2 | 12 | 1.2% |
| 7 | 10 | 1.0% |
| 3 | 10 | 1.0% |
| 6 | 9 | 0.9% |
| 5 | 8 | 0.8% |
| 5.75 | 8 | 0.8% |
| 6.5 | 7 | 0.7% |
| 3.67 | 7 | 0.7% |
| 2.63 | 6 | 0.6% |
| Other values (510) | 897 |
| Value | Count | Frequency (%) |
| 0.3 | 1 | |
| 0.47 | 1 | |
| 0.65 | 1 | |
| 0.71 | 1 | |
| 0.74 | 1 | |
| 0.77 | 1 | |
| 0.79 | 1 | |
| 0.8 | 1 | |
| 0.95 | 1 | |
| 0.98 | 1 |
| Value | Count | Frequency (%) |
| 87 | 1 | |
| 77.33 | 1 | |
| 52 | 1 | |
| 51.75 | 1 | |
| 49.71 | 1 | |
| 49.5 | 1 | |
| 49.25 | 1 | |
| 43.67 | 1 | |
| 43.5 | 1 | |
| 37.5 | 1 |
msi_type
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| Stable | |
|---|---|
| Indeterminate | 41 |
| Unstable | 31 |
Length
| Max length | 13 |
|---|---|
| Median length | 6 |
| Mean length | 6.3525253 |
| Min length | 6 |
Characters and Unicode
| Total characters | 6289 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Stable |
|---|---|
| 2nd row | Stable |
| 3rd row | Stable |
| 4th row | Stable |
| 5th row | Indeterminate |
Common Values
| Value | Count | Frequency (%) |
| Stable | 918 | |
| Indeterminate | 41 | 4.1% |
| Unstable | 31 | 3.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| stable | 918 | |
| indeterminate | 41 | 4.1% |
| unstable | 31 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1072 | |
| t | 1031 | |
| a | 990 | |
| b | 949 | |
| l | 949 | |
| S | 918 | |
| n | 113 | 1.8% |
| I | 41 | 0.7% |
| d | 41 | 0.7% |
| r | 41 | 0.7% |
| Other values (4) | 144 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5299 | |
| Uppercase Letter | 990 | 15.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1072 | |
| t | 1031 | |
| a | 990 | |
| b | 949 | |
| l | 949 | |
| n | 113 | 2.1% |
| d | 41 | 0.8% |
| r | 41 | 0.8% |
| m | 41 | 0.8% |
| i | 41 | 0.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 918 | |
| I | 41 | 4.1% |
| U | 31 | 3.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6289 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1072 | |
| t | 1031 | |
| a | 990 | |
| b | 949 | |
| l | 949 | |
| S | 918 | |
| n | 113 | 1.8% |
| I | 41 | 0.7% |
| d | 41 | 0.7% |
| r | 41 | 0.7% |
| Other values (4) | 144 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6289 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1072 | |
| t | 1031 | |
| a | 990 | |
| b | 949 | |
| l | 949 | |
| S | 918 | |
| n | 113 | 1.8% |
| I | 41 | 0.7% |
| d | 41 | 0.7% |
| r | 41 | 0.7% |
| Other values (4) | 144 | 2.3% |
tmb_mutations_mb
Real number (ℝ)
ZEROS 
| Distinct | 112 |
|---|---|
| Distinct (%) | 11.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.735758 |
| Minimum | 0 |
|---|---|
| Maximum | 368.6 |
| Zeros | 22 |
| Zeros (%) | 2.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3.5 |
| median | 6.1 |
| Q3 | 11.8 |
| 95-th percentile | 43.3 |
| Maximum | 368.6 |
| Range | 368.6 |
| Interquartile range (IQR) | 8.3 |
Descriptive statistics
| Standard deviation | 20.649885 |
|---|---|
| Coefficient of variation (CV) | 1.7595698 |
| Kurtosis | 103.20754 |
| Mean | 11.735758 |
| Median Absolute Deviation (MAD) | 3.5 |
| Skewness | 7.8953863 |
| Sum | 11618.4 |
| Variance | 426.41775 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4.4 | 62 | 6.3% |
| 3.5 | 57 | 5.8% |
| 2.6 | 49 | 4.9% |
| 5.3 | 48 | 4.8% |
| 7.9 | 48 | 4.8% |
| 6.1 | 46 | 4.6% |
| 3.9 | 38 | 3.8% |
| 1.8 | 36 | 3.6% |
| 3 | 35 | 3.5% |
| 2 | 30 | 3.0% |
| Other values (102) | 541 |
| Value | Count | Frequency (%) |
| 0 | 22 | 2.2% |
| 0.9 | 27 | |
| 1 | 18 | 1.8% |
| 1.8 | 36 | |
| 2 | 30 | |
| 2.6 | 49 | |
| 3 | 35 | |
| 3.5 | 57 | |
| 3.9 | 38 | |
| 4.4 | 62 |
| Value | Count | Frequency (%) |
| 368.6 | 1 | |
| 178.2 | 1 | |
| 158.9 | 1 | |
| 153.5 | 1 | |
| 144.8 | 1 | |
| 131.7 | 1 | |
| 111.5 | 1 | |
| 102.7 | 1 | |
| 102.3 | 1 | |
| 93.5 | 1 |
best_response
Categorical
HIGH CORRELATION 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| PD | |
|---|---|
| PR | |
| SD | |
| CR |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1980 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PR |
|---|---|
| 2nd row | PD |
| 3rd row | PR |
| 4th row | PR |
| 5th row | PD |
Common Values
| Value | Count | Frequency (%) |
| PD | 519 | |
| PR | 212 | |
| SD | 199 | 20.1% |
| CR | 60 | 6.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| pd | 519 | |
| pr | 212 | |
| sd | 199 | 20.1% |
| cr | 60 | 6.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 731 | |
| D | 718 | |
| R | 272 | 13.7% |
| S | 199 | 10.1% |
| C | 60 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1980 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 731 | |
| D | 718 | |
| R | 272 | 13.7% |
| S | 199 | 10.1% |
| C | 60 | 3.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1980 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| P | 731 | |
| D | 718 | |
| R | 272 | 13.7% |
| S | 199 | 10.1% |
| C | 60 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1980 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 731 | |
| D | 718 | |
| R | 272 | 13.7% |
| S | 199 | 10.1% |
| C | 60 | 3.0% |
PFS
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 395 |
|---|---|
| Distinct (%) | 39.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.7940202 |
| Minimum | 0.1 |
|---|---|
| Maximum | 52.44 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.5 KiB |
Quantile statistics
| Minimum | 0.1 |
|---|---|
| 5-th percentile | 0.59 |
| Q1 | 1.3575 |
| median | 2.6 |
| Q3 | 8.215 |
| 95-th percentile | 27.37 |
| Maximum | 52.44 |
| Range | 52.34 |
| Interquartile range (IQR) | 6.8575 |
Descriptive statistics
| Standard deviation | 9.3614444 |
|---|---|
| Coefficient of variation (CV) | 1.3778947 |
| Kurtosis | 5.7203496 |
| Mean | 6.7940202 |
| Median Absolute Deviation (MAD) | 1.71 |
| Skewness | 2.3525151 |
| Sum | 6726.08 |
| Variance | 87.636642 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.35 | 18 | 1.8% |
| 1.61 | 15 | 1.5% |
| 1.38 | 15 | 1.5% |
| 1.15 | 14 | 1.4% |
| 1.68 | 14 | 1.4% |
| 1.41 | 13 | 1.3% |
| 0.69 | 12 | 1.2% |
| 0.89 | 12 | 1.2% |
| 1.18 | 12 | 1.2% |
| 0.92 | 12 | 1.2% |
| Other values (385) | 853 |
| Value | Count | Frequency (%) |
| 0.1 | 1 | 0.1% |
| 0.13 | 2 | 0.2% |
| 0.2 | 4 | |
| 0.23 | 5 | |
| 0.26 | 3 | |
| 0.3 | 3 | |
| 0.33 | 1 | 0.1% |
| 0.36 | 3 | |
| 0.39 | 3 | |
| 0.43 | 2 | 0.2% |
| Value | Count | Frequency (%) |
| 52.44 | 1 | |
| 51.75 | 1 | |
| 50.23 | 1 | |
| 49.97 | 1 | |
| 46.88 | 1 | |
| 46.69 | 1 | |
| 46.46 | 1 | |
| 46.36 | 1 | |
| 45.96 | 1 | |
| 45.54 | 1 |
vital_status
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| 1.0 | |
|---|---|
| 0.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2970 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 610 | |
| 0.0 | 380 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 610 | |
| 0.0 | 380 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1370 | |
| . | 990 | |
| 1 | 610 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1980 | |
| Other Punctuation | 990 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1370 | |
| 1 | 610 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 990 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2970 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1370 | |
| . | 990 | |
| 1 | 610 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2970 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1370 | |
| . | 990 | |
| 1 | 610 |
OS
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 606 |
|---|---|
| Distinct (%) | 61.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.378475 |
| Minimum | 0.1 |
|---|---|
| Maximum | 52.44 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.5 KiB |
Quantile statistics
| Minimum | 0.1 |
|---|---|
| 5-th percentile | 0.89 |
| Q1 | 4.1175 |
| median | 10.775 |
| Q3 | 19.55 |
| 95-th percentile | 36.815 |
| Maximum | 52.44 |
| Range | 52.34 |
| Interquartile range (IQR) | 15.4325 |
Descriptive statistics
| Standard deviation | 11.1894 |
|---|---|
| Coefficient of variation (CV) | 0.83637336 |
| Kurtosis | 0.73721163 |
| Mean | 13.378475 |
| Median Absolute Deviation (MAD) | 7.295 |
| Skewness | 1.0731896 |
| Sum | 13244.69 |
| Variance | 125.20267 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.88 | 6 | 0.6% |
| 2.69 | 5 | 0.5% |
| 1.74 | 5 | 0.5% |
| 7.95 | 5 | 0.5% |
| 0.95 | 5 | 0.5% |
| 0.99 | 5 | 0.5% |
| 14.95 | 5 | 0.5% |
| 3.35 | 4 | 0.4% |
| 0.92 | 4 | 0.4% |
| 0.85 | 4 | 0.4% |
| Other values (596) | 942 |
| Value | Count | Frequency (%) |
| 0.1 | 1 | 0.1% |
| 0.13 | 2 | |
| 0.2 | 1 | 0.1% |
| 0.23 | 3 | |
| 0.3 | 1 | 0.1% |
| 0.33 | 1 | 0.1% |
| 0.36 | 3 | |
| 0.39 | 2 | |
| 0.43 | 1 | 0.1% |
| 0.46 | 2 |
| Value | Count | Frequency (%) |
| 52.44 | 1 | |
| 52.04 | 1 | |
| 51.75 | 1 | |
| 50.14 | 1 | |
| 50.07 | 1 | |
| 49.97 | 1 | |
| 49.91 | 1 | |
| 48.82 | 1 | |
| 48.03 | 1 | |
| 47.47 | 1 |
stage
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| IV | |
|---|---|
| III | 54 |
| II | 5 |
| I | 1 |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.0535354 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2033 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | IV |
|---|---|
| 2nd row | IV |
| 3rd row | IV |
| 4th row | IV |
| 5th row | IV |
Common Values
| Value | Count | Frequency (%) |
| IV | 930 | |
| III | 54 | 5.5% |
| II | 5 | 0.5% |
| I | 1 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| iv | 930 | |
| iii | 54 | 5.5% |
| ii | 5 | 0.5% |
| i | 1 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 1103 | |
| V | 930 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2033 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 1103 | |
| V | 930 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2033 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 1103 | |
| V | 930 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2033 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 1103 | |
| V | 930 |
tx_line
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| Subsequent-line | |
|---|---|
| First-line |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 13.368687 |
| Min length | 10 |
Characters and Unicode
| Total characters | 13235 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Subsequent-line |
|---|---|
| 2nd row | First-line |
| 3rd row | Subsequent-line |
| 4th row | First-line |
| 5th row | Subsequent-line |
Common Values
| Value | Count | Frequency (%) |
| Subsequent-line | 667 | |
| First-line | 323 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| subsequent-line | 667 | |
| first-line | 323 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2324 | |
| n | 1657 | |
| u | 1334 | |
| i | 1313 | |
| s | 990 | |
| t | 990 | |
| - | 990 | |
| l | 990 | |
| S | 667 | 5.0% |
| b | 667 | 5.0% |
| Other values (3) | 1313 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11255 | |
| Dash Punctuation | 990 | 7.5% |
| Uppercase Letter | 990 | 7.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2324 | |
| n | 1657 | |
| u | 1334 | |
| i | 1313 | |
| s | 990 | |
| t | 990 | |
| l | 990 | |
| b | 667 | 5.9% |
| q | 667 | 5.9% |
| r | 323 | 2.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 667 | |
| F | 323 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 990 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12245 | |
| Common | 990 | 7.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2324 | |
| n | 1657 | |
| u | 1334 | |
| i | 1313 | |
| s | 990 | |
| t | 990 | |
| l | 990 | |
| S | 667 | 5.4% |
| b | 667 | 5.4% |
| q | 667 | 5.4% |
| Other values (2) | 646 | 5.3% |
Common
| Value | Count | Frequency (%) |
| - | 990 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13235 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2324 | |
| n | 1657 | |
| u | 1334 | |
| i | 1313 | |
| s | 990 | |
| t | 990 | |
| - | 990 | |
| l | 990 | |
| S | 667 | 5.0% |
| b | 667 | 5.0% |
| Other values (3) | 1313 |
ECOG
Categorical
HIGH CORRELATION 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| 1.0 | |
|---|---|
| 0.0 | |
| 2.0 | |
| 3.0 | 11 |
| 4.0 | 2 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2970 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 544 | |
| 0.0 | 357 | |
| 2.0 | 76 | 7.7% |
| 3.0 | 11 | 1.1% |
| 4.0 | 2 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 544 | |
| 0.0 | 357 | |
| 2.0 | 76 | 7.7% |
| 3.0 | 11 | 1.1% |
| 4.0 | 2 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1347 | |
| . | 990 | |
| 1 | 544 | |
| 2 | 76 | 2.6% |
| 3 | 11 | 0.4% |
| 4 | 2 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1980 | |
| Other Punctuation | 990 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1347 | |
| 1 | 544 | |
| 2 | 76 | 3.8% |
| 3 | 11 | 0.6% |
| 4 | 2 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 990 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2970 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1347 | |
| . | 990 | |
| 1 | 544 | |
| 2 | 76 | 2.6% |
| 3 | 11 | 0.4% |
| 4 | 2 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2970 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1347 | |
| . | 990 | |
| 1 | 544 | |
| 2 | 76 | 2.6% |
| 3 | 11 | 0.4% |
| 4 | 2 | 0.1% |
response=Yes
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2970 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 718 | |
| 1.0 | 272 | 27.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 718 | |
| 1.0 | 272 | 27.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1708 | |
| . | 990 | |
| 1 | 272 | 9.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1980 | |
| Other Punctuation | 990 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1708 | |
| 1 | 272 | 13.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 990 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2970 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1708 | |
| . | 990 | |
| 1 | 272 | 9.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2970 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1708 | |
| . | 990 | |
| 1 | 272 | 9.2% |
clin_benefit=Yes
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2970 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 674 | |
| 1.0 | 316 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 674 | |
| 1.0 | 316 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1664 | |
| . | 990 | |
| 1 | 316 | 10.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1980 | |
| Other Punctuation | 990 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1664 | |
| 1 | 316 | 16.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 990 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2970 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1664 | |
| . | 990 | |
| 1 | 316 | 10.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2970 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1664 | |
| . | 990 | |
| 1 | 316 | 10.6% |
cancer_type=GU
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2970 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 851 | |
| 1.0 | 139 | 14.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 851 | |
| 1.0 | 139 | 14.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1841 | |
| . | 990 | |
| 1 | 139 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1980 | |
| Other Punctuation | 990 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1841 | |
| 1 | 139 | 7.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 990 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2970 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1841 | |
| . | 990 | |
| 1 | 139 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2970 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1841 | |
| . | 990 | |
| 1 | 139 | 4.7% |
cancer_type=LGI
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| 0.0 | |
|---|---|
| 1.0 | 59 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2970 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 931 | |
| 1.0 | 59 | 6.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 931 | |
| 1.0 | 59 | 6.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1921 | |
| . | 990 | |
| 1 | 59 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1980 | |
| Other Punctuation | 990 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1921 | |
| 1 | 59 | 3.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 990 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2970 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1921 | |
| . | 990 | |
| 1 | 59 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2970 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1921 | |
| . | 990 | |
| 1 | 59 | 2.0% |
cancer_type=Lung
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2970 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 1.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 534 | |
| 1.0 | 456 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 534 | |
| 1.0 | 456 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1524 | |
| . | 990 | |
| 1 | 456 | 15.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1980 | |
| Other Punctuation | 990 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1524 | |
| 1 | 456 | 23.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 990 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2970 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1524 | |
| . | 990 | |
| 1 | 456 | 15.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2970 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1524 | |
| . | 990 | |
| 1 | 456 | 15.4% |
cancer_type=Melanoma
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2970 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 853 | |
| 1.0 | 137 | 13.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 853 | |
| 1.0 | 137 | 13.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1843 | |
| . | 990 | |
| 1 | 137 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1980 | |
| Other Punctuation | 990 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1843 | |
| 1 | 137 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 990 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2970 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1843 | |
| . | 990 | |
| 1 | 137 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2970 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1843 | |
| . | 990 | |
| 1 | 137 | 4.6% |
cancer_type=UGI
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2970 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 810 | |
| 1.0 | 180 | 18.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 810 | |
| 1.0 | 180 | 18.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1800 | |
| . | 990 | |
| 1 | 180 | 6.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1980 | |
| Other Punctuation | 990 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1800 | |
| 1 | 180 | 9.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 990 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2970 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1800 | |
| . | 990 | |
| 1 | 180 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2970 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1800 | |
| . | 990 | |
| 1 | 180 | 6.1% |
sex=Male
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| 1.0 | |
|---|---|
| 0.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2970 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 1.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 568 | |
| 0.0 | 422 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 568 | |
| 0.0 | 422 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1412 | |
| . | 990 | |
| 1 | 568 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1980 | |
| Other Punctuation | 990 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1412 | |
| 1 | 568 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 990 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2970 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1412 | |
| . | 990 | |
| 1 | 568 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2970 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1412 | |
| . | 990 | |
| 1 | 568 |
drug_class=ctla-4
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| 0.0 | |
|---|---|
| 1.0 | 5 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2970 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 985 | |
| 1.0 | 5 | 0.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 985 | |
| 1.0 | 5 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1975 | |
| . | 990 | |
| 1 | 5 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1980 | |
| Other Punctuation | 990 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1975 | |
| 1 | 5 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 990 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2970 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1975 | |
| . | 990 | |
| 1 | 5 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2970 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1975 | |
| . | 990 | |
| 1 | 5 | 0.2% |
drug_class=pd-1/pd-l1
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| 1.0 | |
|---|---|
| 0.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2970 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 807 | |
| 0.0 | 183 | 18.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 807 | |
| 0.0 | 183 | 18.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1173 | |
| . | 990 | |
| 1 | 807 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1980 | |
| Other Punctuation | 990 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1173 | |
| 1 | 807 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 990 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2970 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1173 | |
| . | 990 | |
| 1 | 807 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2970 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1173 | |
| . | 990 | |
| 1 | 807 |
progression=Yes
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| 1.0 | |
|---|---|
| 0.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2970 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 835 | |
| 0.0 | 155 | 15.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 835 | |
| 0.0 | 155 | 15.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1145 | |
| . | 990 | |
| 1 | 835 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1980 | |
| Other Punctuation | 990 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1145 | |
| 1 | 835 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 990 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2970 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1145 | |
| . | 990 | |
| 1 | 835 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2970 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1145 | |
| . | 990 | |
| 1 | 835 |
age_encoded
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.5585859 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 16 |
| Zeros (%) | 1.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.1906707 |
|---|---|
| Coefficient of variation (CV) | 0.33459097 |
| Kurtosis | 0.17730439 |
| Mean | 3.5585859 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.75694884 |
| Sum | 3523 |
| Variance | 1.4176967 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 346 | |
| 3 | 241 | |
| 5 | 230 | |
| 2 | 109 | 11.0% |
| 1 | 48 | 4.8% |
| 0 | 16 | 1.6% |
| Value | Count | Frequency (%) |
| 0 | 16 | 1.6% |
| 1 | 48 | 4.8% |
| 2 | 109 | 11.0% |
| 3 | 241 | |
| 4 | 346 | |
| 5 | 230 |
| Value | Count | Frequency (%) |
| 5 | 230 | |
| 4 | 346 | |
| 3 | 241 | |
| 2 | 109 | 11.0% |
| 1 | 48 | 4.8% |
| 0 | 16 | 1.6% |
best_response_encoded
Categorical
HIGH CORRELATION 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| 1 | |
|---|---|
| 2 | |
| 3 | |
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 990 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 1 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 519 | |
| 2 | 212 | |
| 3 | 199 | 20.1% |
| 0 | 60 | 6.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 519 | |
| 2 | 212 | |
| 3 | 199 | 20.1% |
| 0 | 60 | 6.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 519 | |
| 2 | 212 | |
| 3 | 199 | 20.1% |
| 0 | 60 | 6.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 990 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 519 | |
| 2 | 212 | |
| 3 | 199 | 20.1% |
| 0 | 60 | 6.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 990 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 519 | |
| 2 | 212 | |
| 3 | 199 | 20.1% |
| 0 | 60 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 990 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 519 | |
| 2 | 212 | |
| 3 | 199 | 20.1% |
| 0 | 60 | 6.1% |
stage_encoded
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| 3 | |
|---|---|
| 2 | 54 |
| 1 | 5 |
| 0 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 990 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 3 |
| 3rd row | 3 |
| 4th row | 3 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 930 | |
| 2 | 54 | 5.5% |
| 1 | 5 | 0.5% |
| 0 | 1 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 930 | |
| 2 | 54 | 5.5% |
| 1 | 5 | 0.5% |
| 0 | 1 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 930 | |
| 2 | 54 | 5.5% |
| 1 | 5 | 0.5% |
| 0 | 1 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 990 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 930 | |
| 2 | 54 | 5.5% |
| 1 | 5 | 0.5% |
| 0 | 1 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 990 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 930 | |
| 2 | 54 | 5.5% |
| 1 | 5 | 0.5% |
| 0 | 1 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 990 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 930 | |
| 2 | 54 | 5.5% |
| 1 | 5 | 0.5% |
| 0 | 1 | 0.1% |
ECOG_encoded
Categorical
HIGH CORRELATION 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| 1 | |
|---|---|
| 0 | |
| 2 | |
| 3 | 11 |
| 4 | 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 990 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 544 | |
| 0 | 357 | |
| 2 | 76 | 7.7% |
| 3 | 11 | 1.1% |
| 4 | 2 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 544 | |
| 0 | 357 | |
| 2 | 76 | 7.7% |
| 3 | 11 | 1.1% |
| 4 | 2 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 544 | |
| 0 | 357 | |
| 2 | 76 | 7.7% |
| 3 | 11 | 1.1% |
| 4 | 2 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 990 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 544 | |
| 0 | 357 | |
| 2 | 76 | 7.7% |
| 3 | 11 | 1.1% |
| 4 | 2 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 990 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 544 | |
| 0 | 357 | |
| 2 | 76 | 7.7% |
| 3 | 11 | 1.1% |
| 4 | 2 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 990 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 544 | |
| 0 | 357 | |
| 2 | 76 | 7.7% |
| 3 | 11 | 1.1% |
| 4 | 2 | 0.2% |
msi_type_encoded
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| 1 | |
|---|---|
| 0 | 41 |
| 2 | 31 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 990 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 918 | |
| 0 | 41 | 4.1% |
| 2 | 31 | 3.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 918 | |
| 0 | 41 | 4.1% |
| 2 | 31 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 918 | |
| 0 | 41 | 4.1% |
| 2 | 31 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 990 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 918 | |
| 0 | 41 | 4.1% |
| 2 | 31 | 3.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 990 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 918 | |
| 0 | 41 | 4.1% |
| 2 | 31 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 990 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 918 | |
| 0 | 41 | 4.1% |
| 2 | 31 | 3.1% |
tx_line_encoded
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.5 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 990 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 667 | |
| 0 | 323 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 667 | |
| 0 | 323 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 667 | |
| 0 | 323 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 990 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 667 | |
| 0 | 323 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 990 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 667 | |
| 0 | 323 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 990 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 667 | |
| 0 | 323 |
| ECOG | ECOG_encoded | OS | PFS | age | age_encoded | best_response | best_response_encoded | cancer_type=GU | cancer_type=LGI | cancer_type=Lung | cancer_type=Melanoma | cancer_type=UGI | clin_benefit=Yes | drug_class=ctla-4 | drug_class=pd-1/pd-l1 | id | msi_type | msi_type_encoded | nlr | progression=Yes | response=Yes | sex=Male | stage | stage_encoded | tmb_mutations_mb | tx_line | tx_line_encoded | tx_year | vital_status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ECOG | 1.000 | 1.000 | -0.372 | -0.288 | 0.067 | 0.170 | 0.146 | 0.146 | 0.000 | 0.000 | 0.161 | 0.221 | 0.056 | 0.205 | 0.000 | 0.099 | 0.035 | 0.083 | 0.083 | 0.299 | 0.190 | 0.204 | 0.042 | 0.032 | 0.032 | 0.031 | 0.237 | 0.237 | 0.069 | 0.269 |
| ECOG_encoded | 1.000 | 1.000 | -0.372 | -0.288 | 0.067 | 0.170 | 0.146 | 0.146 | 0.000 | 0.000 | 0.161 | 0.221 | 0.056 | 0.205 | 0.000 | 0.099 | 0.035 | 0.083 | 0.083 | 0.299 | 0.190 | 0.204 | 0.042 | 0.032 | 0.032 | 0.031 | 0.237 | 0.237 | 0.069 | 0.269 |
| OS | -0.372 | -0.372 | 1.000 | 0.711 | 0.056 | -0.048 | 0.369 | 0.369 | 0.158 | 0.000 | 0.094 | 0.190 | 0.167 | 0.598 | 0.000 | 0.205 | -0.120 | 0.000 | 0.000 | -0.355 | 0.417 | 0.539 | 0.071 | 0.057 | 0.057 | 0.142 | 0.330 | 0.330 | 0.431 | 0.617 |
| PFS | -0.288 | -0.288 | 0.711 | 1.000 | 0.000 | 0.020 | 0.519 | 0.519 | 0.108 | 0.000 | 0.071 | 0.172 | 0.038 | 0.861 | 0.000 | 0.161 | -0.015 | 0.098 | 0.098 | -0.258 | 0.771 | 0.760 | 0.000 | 0.068 | 0.068 | 0.196 | 0.309 | 0.309 | 0.306 | 0.607 |
| age | 0.067 | 0.067 | 0.056 | 0.000 | 1.000 | 1.000 | 0.048 | 0.048 | 0.068 | 0.203 | 0.197 | 0.049 | 0.041 | 0.000 | 0.011 | 0.077 | 0.043 | 0.057 | 0.057 | 0.065 | 0.000 | 0.018 | 0.079 | 0.000 | 0.000 | 0.168 | 0.041 | 0.041 | 0.000 | 0.055 |
| age_encoded | 0.170 | 0.170 | -0.048 | 0.020 | 1.000 | 1.000 | 0.048 | 0.048 | 0.068 | 0.203 | 0.197 | 0.049 | 0.041 | 0.000 | 0.011 | 0.077 | 0.043 | 0.057 | 0.057 | 0.065 | 0.000 | 0.018 | 0.079 | 0.000 | 0.000 | 0.168 | 0.041 | 0.041 | 0.000 | 0.055 |
| best_response | 0.146 | 0.146 | 0.369 | 0.519 | 0.048 | 0.048 | 1.000 | 1.000 | 0.000 | 0.000 | 0.148 | 0.356 | 0.069 | 0.916 | 0.000 | 0.105 | 0.036 | 0.061 | 0.061 | -0.027 | 0.623 | 0.999 | 0.033 | 0.088 | 0.088 | 0.001 | 0.328 | 0.328 | 0.072 | 0.545 |
| best_response_encoded | 0.146 | 0.146 | 0.369 | 0.519 | 0.048 | 0.048 | 1.000 | 1.000 | 0.000 | 0.000 | 0.148 | 0.356 | 0.069 | 0.916 | 0.000 | 0.105 | 0.036 | 0.061 | 0.061 | -0.027 | 0.623 | 0.999 | 0.033 | 0.088 | 0.088 | 0.001 | 0.328 | 0.328 | 0.072 | 0.545 |
| cancer_type=GU | 0.000 | 0.000 | 0.158 | 0.108 | 0.068 | 0.068 | 0.000 | 0.000 | 1.000 | 0.090 | 0.369 | 0.155 | 0.184 | 0.000 | 0.000 | 0.034 | -0.096 | 0.054 | 0.054 | -0.046 | 0.033 | 0.000 | 0.081 | 0.091 | 0.091 | -0.129 | 0.000 | 0.000 | 0.093 | 0.000 |
| cancer_type=LGI | 0.000 | 0.000 | 0.000 | 0.000 | 0.203 | 0.203 | 0.000 | 0.000 | 0.090 | 1.000 | 0.226 | 0.089 | 0.109 | 0.000 | 0.000 | 0.000 | -0.016 | 0.372 | 0.372 | -0.005 | 0.000 | 0.000 | 0.033 | 0.000 | 0.000 | 0.121 | 0.102 | 0.102 | 0.070 | 0.000 |
| cancer_type=Lung | 0.161 | 0.161 | 0.094 | 0.071 | 0.197 | 0.197 | 0.148 | 0.148 | 0.369 | 0.226 | 1.000 | 0.366 | 0.432 | 0.062 | 0.041 | 0.087 | 0.077 | 0.123 | 0.123 | 0.157 | 0.112 | 0.074 | 0.119 | 0.041 | 0.041 | 0.068 | 0.140 | 0.140 | 0.122 | 0.093 |
| cancer_type=Melanoma | 0.221 | 0.221 | 0.190 | 0.172 | 0.049 | 0.049 | 0.356 | 0.356 | 0.155 | 0.089 | 0.366 | 1.000 | 0.182 | 0.171 | 0.000 | 0.248 | 0.020 | 0.070 | 0.070 | -0.146 | 0.216 | 0.200 | 0.095 | 0.181 | 0.181 | 0.197 | 0.491 | 0.491 | 0.127 | 0.171 |
| cancer_type=UGI | 0.056 | 0.056 | 0.167 | 0.038 | 0.041 | 0.041 | 0.069 | 0.069 | 0.184 | 0.109 | 0.432 | 0.182 | 1.000 | 0.031 | 0.000 | 0.005 | -0.012 | 0.049 | 0.049 | -0.013 | 0.078 | 0.026 | 0.068 | 0.000 | 0.000 | -0.201 | 0.126 | 0.126 | 0.164 | 0.054 |
| clin_benefit=Yes | 0.205 | 0.205 | 0.598 | 0.861 | 0.000 | 0.000 | 0.916 | 0.916 | 0.000 | 0.000 | 0.062 | 0.171 | 0.031 | 1.000 | 0.000 | 0.084 | 0.044 | 0.092 | 0.092 | -0.190 | 0.584 | 0.896 | 0.000 | 0.068 | 0.068 | 0.198 | 0.319 | 0.319 | 0.091 | 0.553 |
| drug_class=ctla-4 | 0.000 | 0.000 | 0.000 | 0.000 | 0.011 | 0.011 | 0.000 | 0.000 | 0.000 | 0.000 | 0.041 | 0.000 | 0.000 | 0.000 | 1.000 | 0.127 | -0.034 | 0.000 | 0.000 | 0.004 | 0.000 | 0.000 | 0.023 | 0.000 | 0.000 | 0.008 | 0.000 | 0.000 | 0.053 | 0.000 |
| drug_class=pd-1/pd-l1 | 0.099 | 0.099 | 0.205 | 0.161 | 0.077 | 0.077 | 0.105 | 0.105 | 0.034 | 0.000 | 0.087 | 0.248 | 0.005 | 0.084 | 0.127 | 1.000 | 0.063 | 0.062 | 0.062 | 0.040 | 0.079 | 0.101 | 0.000 | 0.056 | 0.056 | -0.012 | 0.191 | 0.191 | 0.159 | 0.031 |
| id | 0.035 | 0.035 | -0.120 | -0.015 | 0.043 | 0.043 | 0.036 | 0.036 | -0.096 | -0.016 | 0.077 | 0.020 | -0.012 | 0.044 | -0.034 | 0.063 | 1.000 | 0.038 | 0.038 | 0.026 | 0.000 | 0.000 | 0.000 | 0.035 | 0.035 | -0.001 | 0.251 | 0.251 | 0.476 | 0.077 |
| msi_type | 0.083 | 0.083 | 0.000 | 0.098 | 0.057 | 0.057 | 0.061 | 0.061 | 0.054 | 0.372 | 0.123 | 0.070 | 0.049 | 0.092 | 0.000 | 0.062 | 0.038 | 1.000 | 1.000 | -0.004 | 0.125 | 0.099 | 0.016 | 0.020 | 0.020 | 0.147 | 0.089 | 0.089 | 0.000 | 0.033 |
| msi_type_encoded | 0.083 | 0.083 | 0.000 | 0.098 | 0.057 | 0.057 | 0.061 | 0.061 | 0.054 | 0.372 | 0.123 | 0.070 | 0.049 | 0.092 | 0.000 | 0.062 | 0.038 | 1.000 | 1.000 | -0.004 | 0.125 | 0.099 | 0.016 | 0.020 | 0.020 | 0.147 | 0.089 | 0.089 | 0.000 | 0.033 |
| nlr | 0.299 | 0.299 | -0.355 | -0.258 | 0.065 | 0.065 | -0.027 | -0.027 | -0.046 | -0.005 | 0.157 | -0.146 | -0.013 | -0.190 | 0.004 | 0.040 | 0.026 | -0.004 | -0.004 | 1.000 | 0.097 | 0.123 | 0.000 | 0.000 | 0.000 | 0.033 | 0.114 | 0.114 | 0.018 | 0.207 |
| progression=Yes | 0.190 | 0.190 | 0.417 | 0.771 | 0.000 | 0.000 | 0.623 | 0.623 | 0.033 | 0.000 | 0.112 | 0.216 | 0.078 | 0.584 | 0.000 | 0.079 | 0.000 | 0.125 | 0.125 | 0.097 | 1.000 | 0.578 | 0.036 | 0.128 | 0.128 | -0.233 | 0.253 | 0.253 | 0.062 | 0.542 |
| response=Yes | 0.204 | 0.204 | 0.539 | 0.760 | 0.018 | 0.018 | 0.999 | 0.999 | 0.000 | 0.000 | 0.074 | 0.200 | 0.026 | 0.896 | 0.000 | 0.101 | 0.000 | 0.099 | 0.099 | 0.123 | 0.578 | 1.000 | 0.000 | 0.066 | 0.066 | 0.191 | 0.306 | 0.306 | 0.100 | 0.498 |
| sex=Male | 0.042 | 0.042 | 0.071 | 0.000 | 0.079 | 0.079 | 0.033 | 0.033 | 0.081 | 0.033 | 0.119 | 0.095 | 0.068 | 0.000 | 0.023 | 0.000 | 0.000 | 0.016 | 0.016 | 0.000 | 0.036 | 0.000 | 1.000 | 0.026 | 0.026 | 0.054 | 0.100 | 0.100 | 0.000 | 0.000 |
| stage | 0.032 | 0.032 | 0.057 | 0.068 | 0.000 | 0.000 | 0.088 | 0.088 | 0.091 | 0.000 | 0.041 | 0.181 | 0.000 | 0.068 | 0.000 | 0.056 | 0.035 | 0.020 | 0.020 | 0.000 | 0.128 | 0.066 | 0.026 | 1.000 | 1.000 | -0.080 | 0.128 | 0.128 | 0.000 | 0.126 |
| stage_encoded | 0.032 | 0.032 | 0.057 | 0.068 | 0.000 | 0.000 | 0.088 | 0.088 | 0.091 | 0.000 | 0.041 | 0.181 | 0.000 | 0.068 | 0.000 | 0.056 | 0.035 | 0.020 | 0.020 | 0.000 | 0.128 | 0.066 | 0.026 | 1.000 | 1.000 | -0.080 | 0.128 | 0.128 | 0.000 | 0.126 |
| tmb_mutations_mb | 0.031 | 0.031 | 0.142 | 0.196 | 0.168 | 0.168 | 0.001 | 0.001 | -0.129 | 0.121 | 0.068 | 0.197 | -0.201 | 0.198 | 0.008 | -0.012 | -0.001 | 0.147 | 0.147 | 0.033 | -0.233 | 0.191 | 0.054 | -0.080 | -0.080 | 1.000 | 0.115 | 0.115 | 0.016 | 0.123 |
| tx_line | 0.237 | 0.237 | 0.330 | 0.309 | 0.041 | 0.041 | 0.328 | 0.328 | 0.000 | 0.102 | 0.140 | 0.491 | 0.126 | 0.319 | 0.000 | 0.191 | 0.251 | 0.089 | 0.089 | 0.114 | 0.253 | 0.306 | 0.100 | 0.128 | 0.128 | 0.115 | 1.000 | 0.998 | 0.177 | 0.226 |
| tx_line_encoded | 0.237 | 0.237 | 0.330 | 0.309 | 0.041 | 0.041 | 0.328 | 0.328 | 0.000 | 0.102 | 0.140 | 0.491 | 0.126 | 0.319 | 0.000 | 0.191 | 0.251 | 0.089 | 0.089 | 0.114 | 0.253 | 0.306 | 0.100 | 0.128 | 0.128 | 0.115 | 0.998 | 1.000 | 0.177 | 0.226 |
| tx_year | 0.069 | 0.069 | 0.431 | 0.306 | 0.000 | 0.000 | 0.072 | 0.072 | 0.093 | 0.070 | 0.122 | 0.127 | 0.164 | 0.091 | 0.053 | 0.159 | 0.476 | 0.000 | 0.000 | 0.018 | 0.062 | 0.100 | 0.000 | 0.000 | 0.000 | 0.016 | 0.177 | 0.177 | 1.000 | 0.103 |
| vital_status | 0.269 | 0.269 | 0.617 | 0.607 | 0.055 | 0.055 | 0.545 | 0.545 | 0.000 | 0.000 | 0.093 | 0.171 | 0.054 | 0.553 | 0.000 | 0.031 | 0.077 | 0.033 | 0.033 | 0.207 | 0.542 | 0.498 | 0.000 | 0.126 | 0.126 | 0.123 | 0.226 | 0.226 | 0.103 | 1.000 |
| id | tx_year | age | nlr | msi_type | tmb_mutations_mb | best_response | PFS | vital_status | OS | stage | tx_line | ECOG | response=Yes | clin_benefit=Yes | cancer_type=GU | cancer_type=LGI | cancer_type=Lung | cancer_type=Melanoma | cancer_type=UGI | sex=Male | drug_class=ctla-4 | drug_class=pd-1/pd-l1 | progression=Yes | age_encoded | best_response_encoded | stage_encoded | ECOG_encoded | msi_type_encoded | tx_line_encoded | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2 | 8215 | 2018 | 61 - 70 | 1.38 | Stable | 19.3 | PR | 3.45 | 0.0 | 9.59 | IV | Subsequent-line | 0.0 | 1.0 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 1.0 | 4 | 2 | 3 | 0 | 1 | 1 |
| 3 | 8216 | 2015 | 51 - 60 | 2.69 | Stable | 1.0 | PD | 0.56 | 0.0 | 50.14 | IV | First-line | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 3 | 1 | 3 | 0 | 1 | 0 |
| 4 | 8217 | 2018 | 51 - 60 | 2.54 | Stable | 10.5 | PR | 4.67 | 0.0 | 9.99 | IV | Subsequent-line | 0.0 | 1.0 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 1.0 | 3 | 2 | 3 | 0 | 1 | 1 |
| 6 | 8219 | 2017 | 51 - 60 | 5.21 | Stable | 0.0 | PR | 8.61 | 1.0 | 18.83 | IV | First-line | 0.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 1.0 | 0.0 | 1.0 | 1.0 | 3 | 2 | 3 | 0 | 1 | 0 |
| 8 | 8221 | 2016 | 61 - 70 | 2.18 | Indeterminate | 2.0 | PD | 1.25 | 1.0 | 2.33 | IV | Subsequent-line | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 1.0 | 1.0 | 4 | 1 | 3 | 1 | 0 | 1 |
| 9 | 8222 | 2017 | 51 - 60 | 4.62 | Stable | 4.4 | PD | 1.97 | 1.0 | 3.12 | IV | Subsequent-line | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 1.0 | 1.0 | 3 | 1 | 3 | 1 | 1 | 1 |
| 11 | 8223 | 2017 | 61 - 70 | 2.56 | Indeterminate | 41.3 | SD | 7.13 | 1.0 | 7.13 | IV | Subsequent-line | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 1.0 | 4 | 3 | 3 | 0 | 0 | 1 |
| 14 | 8226 | 2018 | 61 - 70 | 4.89 | Stable | 3.5 | PD | 1.35 | 0.0 | 3.22 | IV | Subsequent-line | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 1.0 | 4 | 1 | 3 | 1 | 1 | 1 |
| 17 | 8229 | 2015 | 51 - 60 | 2.92 | Stable | 19.7 | CR | 45.96 | 0.0 | 48.82 | IV | Subsequent-line | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 3 | 0 | 3 | 1 | 1 | 1 |
| 18 | 8230 | 2017 | 61 - 70 | 4.25 | Stable | 8.8 | PD | 1.64 | 1.0 | 6.77 | IV | Subsequent-line | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 1.0 | 1.0 | 4 | 1 | 3 | 1 | 1 | 1 |
| id | tx_year | age | nlr | msi_type | tmb_mutations_mb | best_response | PFS | vital_status | OS | stage | tx_line | ECOG | response=Yes | clin_benefit=Yes | cancer_type=GU | cancer_type=LGI | cancer_type=Lung | cancer_type=Melanoma | cancer_type=UGI | sex=Male | drug_class=ctla-4 | drug_class=pd-1/pd-l1 | progression=Yes | age_encoded | best_response_encoded | stage_encoded | ECOG_encoded | msi_type_encoded | tx_line_encoded | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1617 | 10016 | 2018 | 51 - 60 | 12.30 | Stable | 2.6 | SD | 2.46 | 1.0 | 6.08 | IV | First-line | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 1.0 | 1.0 | 3 | 3 | 3 | 0 | 1 | 0 |
| 1618 | 10017 | 2018 | 71 - 95 | 19.88 | Unstable | 47.4 | SD | 1.74 | 1.0 | 1.74 | IV | First-line | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 1.0 | 5 | 3 | 3 | 1 | 2 | 0 |
| 1621 | 10020 | 2018 | 61 - 70 | 9.45 | Stable | 17.6 | PD | 0.89 | 1.0 | 3.45 | IV | First-line | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 1.0 | 1.0 | 4 | 1 | 3 | 0 | 1 | 0 |
| 1622 | 10021 | 2018 | 61 - 70 | 5.92 | Stable | 3.5 | CR | 10.91 | 0.0 | 13.83 | IV | First-line | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 1.0 | 0.0 | 1.0 | 1.0 | 4 | 0 | 3 | 1 | 1 | 0 |
| 1623 | 10023 | 2018 | 61 - 70 | 1.86 | Stable | 7.9 | PR | 12.45 | 0.0 | 15.90 | IV | First-line | 0.0 | 1.0 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 1.0 | 1.0 | 4 | 2 | 3 | 0 | 1 | 0 |
| 1624 | 10025 | 2018 | 51 - 60 | 9.25 | Stable | 0.0 | PD | 0.82 | 1.0 | 0.82 | IV | First-line | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 1.0 | 1.0 | 3 | 1 | 3 | 1 | 1 | 0 |
| 1626 | 10027 | 2018 | 61 - 70 | 6.50 | Stable | 12.3 | SD | 11.70 | 0.0 | 12.22 | III | Subsequent-line | 1.0 | 0.0 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 1.0 | 0.0 | 4 | 3 | 2 | 1 | 1 | 1 |
| 1627 | 10028 | 2018 | 71 - 95 | 4.82 | Stable | 9.7 | PD | 1.54 | 1.0 | 1.54 | IV | First-line | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 1.0 | 1.0 | 5 | 1 | 3 | 1 | 1 | 0 |
| 1629 | 10030 | 2018 | 51 - 60 | 4.92 | Stable | 3.5 | SD | 8.08 | 0.0 | 14.95 | IV | First-line | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 1.0 | 1.0 | 3 | 3 | 3 | 0 | 1 | 0 |
| 1630 | 10032 | 2018 | 41 - 50 | 34.67 | Stable | 9.7 | PD | 1.38 | 1.0 | 3.15 | IV | Subsequent-line | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | 1.0 | 1.0 | 2 | 1 | 3 | 1 | 1 | 1 |